Goto

Collaborating Authors

 Lashkar Gah


DNA-DetectLLM: Unveiling AI-Generated Text via a DNA-Inspired Mutation-Repair Paradigm

Zhu, Xiaowei, Ren, Yubing, Fang, Fang, Tan, Qingfeng, Wang, Shi, Cao, Yanan

arXiv.org Artificial Intelligence

The rapid advancement of large language models (LLMs) has blurred the line between AI-generated and human-written text. This progress brings societal risks such as misinformation, authorship ambiguity, and intellectual property concerns, highlighting the urgent need for reliable AI-generated text detection methods. However, recent advances in generative language modeling have resulted in significant overlap between the feature distributions of human-written and AI-generated text, blurring classification boundaries and making accurate detection increasingly challenging. To address the above challenges, we propose a DNA-inspired perspective, leveraging a repair-based process to directly and interpretably capture the intrinsic differences between human-written and AI-generated text. Building on this perspective, we introduce DNA-DetectLLM, a zero-shot detection method for distinguishing AI-generated and human-written text. The method constructs an ideal AI-generated sequence for each input, iteratively repairs non-optimal tokens, and quantifies the cumulative repair effort as an interpretable detection signal. Empirical evaluations demonstrate that our method achieves state-of-the-art detection performance and exhibits strong robustness against various adversarial attacks and input lengths. Specifically, DNA-DetectLLM achieves relative improvements of 5.55% in AUROC and 2.08% in F1 score across multiple public benchmark datasets. Code and data are available at https://github.com/Xiaoweizhu57/DNA-DetectLLM.


Hidden Pentagon records reveal patterns of failure in deadly U.S. airstrikes

The Japan Times

Shortly before 3 a.m. on July 19, 2016, U.S. Special Operations forces bombed what they believed were three Islamic State (IS) group "staging areas" on the outskirts of Tokhar, a riverside hamlet in northern Syria. They reported 85 fighters killed. In fact, they hit houses far from the front line, where farmers, their families and other local people sought nighttime sanctuary from bombing and gunfire. More than 120 villagers were killed. In early 2017 in Iraq, an American war plane struck a dark-colored vehicle, believed to be a car bomb, stopped at an intersection in the Wadi Hajar neighborhood of West Mosul. Actually, the car had been bearing not a bomb but a man named Majid Mahmoud Ahmed, his wife and their two children, who were fleeing the fighting nearby. They and three other civilians were killed. In November 2015, after observing a man dragging an "unknown heavy object" into an IS "defensive fighting position," U.S. forces struck a building in Ramadi, Iraq. A military review found that the object was actually "a person of small stature" -- a child -- who died in the strike. None of these deadly failures resulted in a finding of wrongdoing. These cases are drawn from a hidden Pentagon archive of the American air war in the Middle East since 2014. The trove of documents -- the military's own confidential assessments of more than 1,300 reports of civilian casualties, obtained by The New York Times -- lays bare how the air war has been marked by deeply flawed intelligence, rushed and often imprecise targeting and the deaths of thousands of civilians, many of them children, a sharp contrast to the U.S. government's image of war waged by all-seeing drones and precision bombs. The documents show, too, that despite the Pentagon's highly codified system for examining civilian casualties, pledges of transparency and accountability have given way to opacity and impunity. In only a handful of cases were the assessments made public. Not a single record provided includes a finding of wrongdoing or disciplinary action. Fewer than a dozen condolence payments were made, even though many survivors were left with disabilities requiring expensive medical care. Documented efforts to identify root causes or lessons learned are rare. The air campaign represents a fundamental transformation of warfare that took shape in the final years of the Obama administration, amid the deepening unpopularity of the forever wars that had claimed more than 6,000 American service members. The United States traded many of its boots on the ground for an arsenal of aircraft directed by controllers sitting at computers, often thousands of kilometers away. President Barack Obama called it "the most precise air campaign in history." This was the promise: America's "extraordinary technology" would allow the military to kill the right people while taking the greatest possible care not to harm the wrong ones. The IS caliphate ultimately crumbled under the weight of American bombing.


Abductive Inference for Combat: Using SCARE-S2 to Find High-Value Targets in Afghanistan

Shakarian, Paulo (U.S. Army) | Nagel, Mago (University of Maryland) | Schuetzle, Brittany (University of Maryland) | Subrahmanian, V.S. (University of Maryland)

AAAI Conferences

Recently, geospatial abduction was introduced by the authors in [Shakarian et. al. 2010] as a way to infer unobserved geographic phenomena from a set of known observations and constraints between the two. In this paper, we introduce the SCARE-S2 software tool which applies geospatial abduction to the environment of Afghanistan. Unlike previous work, where we looked for small weapon caches supporting local attacks, here we look for insurgent high-value targets (HVT's), supporting insurgent operations in two provinces. These HVT's include the locations of insurgent leaders and major supply depots. Applying this method of inference to Afghanistan introduces several practical issues not addressed in previous work. Namely, we are conducting inference in a much larger area (24,940 sq km as compared to 675 sq km in previous work), on more varied terrain, and must consider the influence of many local tribes. We address all of these problems and evaluate our software on 6 months of real-world counter-insurgency data. We show that we are able to abduce regions of a relatively small area (on average, under 100 sq km and each containing, on average, 4.8 villages) that are more dense with HVT's (35 X more than the overall area considered).